Consonant recognition and the articulation index.

نویسنده

  • Jont B Allen
چکیده

The purpose of this paper is to provide insight into how speech is processed by the auditory system, by quantifying the nature of nonsense speech sound confusions. (1) The Miller and Nicely [J. Acoust. Soc. Am. 27(2), 338-352 (1955)] confusion matrix (CM) data are analyzed by plotting the CM elements Si,j(SNR) as a function of the signal-to-noise ratio (SNR). This allows for the robust clustering of perceptual feature (event) groups, not robustly defined by a single CM table, where clusters depend on the sound order. (2) The SNR is then re-expressed as an articulation index (AI), and used as the independent variable. The normalized log scores log(1-Si,i(AI)) and log(Si,j(AI)), j not equal to i, then become linear functions of AI, on log-error versus AI plots. This linear dependence may be interpreted as an extension of the band-independence model of Fletcher. (3) The model formula for the average score for the finite-alphabet case Pc(AI,H)= sigma(N)i=1Si,i/N is then modified to include the effect of entropy H. Due to the grouping of sounds with increased SNR (and AI), the sound-group entropy Hg plays a key role in this performance measure. (4) A parametric model for the confusions Si,j(AI,Hg) is then described, which characterizes the confusions between competing sounds within a group.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Percentage of Consonants Correct for 3-5 Years Old Kurdish-Speaking Children With Middle Kurmanji-Mukryani Dialect

Objectives: The present research aims to study the normal development of Percentage of Consonant Correct (PCC) in Kurdish-speaking children, with Middle Kurmanji-Mukryani Dialect as an Articulation Competency Index (ACI). PCC was examined in terms of the manner of articulation and position of sound in the word.  Methods: In this descriptoanalytical cross-sectional study, 120 Kurdish-speak...

متن کامل

Consonant confusions in white noise.

The classic [MN55] confusion matrix experiment (16 consonants, white noise masker) was repeated by using computerized procedures, similar to those of Phatak and Allen (2007). ["Consonant and vowel confusions in speech-weighted noise," J. Acoust. Soc. Am. 121, 2312-2316]. The consonant scores in white noise can be categorized in three sets: low-error set [/m/, /n/], average-error set [/p/, /t/, ...

متن کامل

Effects of vowel context on the recognition of initial and medial consonants by cochlear implant users.

OBJECTIVE Scores on consonant-recognition tests are widely used as an index of speech-perception ability in cochlear implant (CI) users. The consonant stimuli in these tests are typically presented in the /alpha/ vowel context, even though consonants in conversational speech occur in many other contexts. For this reason, it would be useful to know whether vowel context has any systematic effect...

متن کامل

Consonant and vowel confusions in speech-weighted noise

This paper presents the results of a closed-set recognition task for 64 consonant-vowel sounds (16 C X 4 V, spoken by 18 talkers) in speech-weighted noise (-22,-20,-16,-10,-2 [dB]) and in quiet. The confusion matrices were generated using responses of a homogeneous set of ten listeners and the confusions were analyzed using a graphical method. In speech-weighted noise the consonants separate in...

متن کامل

The influence of stop consonants' perceptual features on the Articulation Index model.

Studies on consonant perception under noise conditions typically describe the average consonant error as exponential in the Articulation Index (AI). While this AI formula nicely fits the average error over all consonants, it does not fit the error for any consonant at the utterance level. This study analyzes the error patterns of six stop consonants /p, t, k, b, d, g/ with four vowels (/α/, /ε/...

متن کامل

Knowledge based approach to consonant recognition

This paper presents a knowledge based approach to consonant recognition. In traditional knowledge based systems, the expert is the linguist/phonetician who attempts to describe and quantify the acoustic events, in the form of production rules into phonetic description. This paper proposes to alter the expert's role so that the expert only needs to provide the basic structure of the phonetic cla...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • The Journal of the Acoustical Society of America

دوره 117 4 Pt 1  شماره 

صفحات  -

تاریخ انتشار 2005